Interpolating Conditional Density Trees
نویسندگان
چکیده
Joint distributions over many variables are frequently modeled by decomposing them into products of simpler, lower-dimensional conditional distributions, such as in sparsely connected Bayesian networks. However, au tomatically learning such models can be very computationally expensive when there are many datapoints and many continuous vari ables with complex nonlinear relationships, particularly when no good ways of decom posing the joint distribution are known a pri ori. In such situations, previous research has generally focused on the use of discretization techniques in which each continuous vari able has a single discretization that is used throughout the entire network. In this paper, we present and compare a wide variety of tree-based algorithms for learning and evaluating conditional density estimates over continuous variables. These trees can be thought of as discretizations that vary ac cording to the particular interactions being modeled; however, the density within a given leaf of the tree need not be assumed con stant, and we show that such nonuniform leaf densities lead to more accurate density esti mation. We have developed Bayesian net work structure-learning algorithms that em ploy these tree-based conditional density rep resentations, and we show that they can be used to practically learn complex joint prob ability models over dozens of continuous vari ables from thousands of data points. We focus on finding models that are simultaneously ac curate, fast to learn, and fast to evaluate once they are learned.
منابع مشابه
تأثیر الگوی پراکنش درختان بر برآورد تراکم با روش نمونه برداری نزدیکترین فرد: مطالعات موردی در درختزارهای بنه زاگرس و تودههای شبیه سازی شده
Distance methods and their estimators of density may have biased measurements unless the studied stand of trees has a random spatial pattern. This study aimed at assessing the effect of spatial arrangement of wild pistachio trees on the results of density estimation by using the nearest individual sampling method in Zagros woodlands, Iran, and applying a correction factor based on the spatial p...
متن کاملON THE STATIONARY PROBABILITY DENSITY FUNCTION OF BILINEAR TIME SERIES MODELS: A NUMERICAL APPROACH
In this paper, we show that the Chapman-Kolmogorov formula could be used as a recursive formula for computing the m-step-ahead conditional density of a Markov bilinear model. The stationary marginal probability density function of the model may be approximated by the m-step-ahead conditional density for sufficiently large m.
متن کاملTree Representations and the Usability of Interpolating Controllers
Interpolation is ubiquitous as a user interface method: it is here investigated for its usability using both numerical and experimental methods. Three properties—consistency, locality, and monotonicity—are stated to be required for interpolating controllers to be usable. These properties are defined quantitatively. An existing linear representation for tree structures is shown to have relativel...
متن کاملBagging classifiers based on Kernel density estimators
A lot of research is being conducted on combining classification rules (classifiers) to produce a single one, known as an ensemble, which in general is more accurate than the individual classifiers making up the ensemble. Two popular methods for creating ensembles are Bagging introduced by Breiman, (1996) and, AdaBoosting by Freund and Schapire (1996). These methods rely on resampling technique...
متن کاملRecursive partitioning and Bayesian inference on conditional distributions
In this work we introduce a Bayesian framework for nonparametric inference on conditional distributions in the form of a prior called the conditional optional Pólya tree. The prior is constructed based on a two-stage nested procedure, which in the first stage recursively partitions the predictor space, and then in the second generates the conditional distribution on those predictor blocks using...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002